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FIELD OF THE INVENTION 



The present invention relates generally to data processing systems and, 
more particularly, to passing serialized versions of objects in a distributed system. 
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: BACKGROUND OF THE INVENTION 

Distributed systems can be made up of various components, including both 
hardware and software. A distributed system (1 ) allows its users to share services 
and resources over a network of many devices; (2) provides programmers with tools 
and programming patterns that allow development of robust, secured distributed 
systems; and (3) simplifies the task of administering the distributed system. 

A distributed system can be implemented using an object-oriented 
; programming language, such as Java™. The Java™ programming language is 
I typically compiled into a platform-independent format, using a bytecode instruction 
set, which can be executed on any platform supporting the Java™ virtual machine, 
i; The Java™ programming language is described in greater detail in The Java™ 
' Language Specification by James Gosling, Bill Joy, and Guy Steele, Addison- 
Wesley, 1996, which is incorporated herein by reference. Java™ and Java-based 
trademarks are trademarks or registered trademarks of Sun Microsystems, Inc. in 
the United States and other countries. 

Distributed systems require that programs running in different address 
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I oriented programming language, such as the Java™ programming language, this 

\ communication can be achieved by passing an "object," which represents an item 

il 

I or instance manipulated by the system, from one program to another. In such a 
ii system, a "class" provides a template for the creation of objects having 
5 i characteristics of that class. The objects in each class share certain characteristics 

\: or attributes determined by the class. A class thus defines the type of an object. 

Objects are typically created dynamically during system operation. Methods 
1; associated with a class are generally invoked on the objects of the same class or 
'■n I subclass. 

10 in In a Java™ distributed system, an object is referred to as being remote 

i when its methods can be invoked from another address space, typically a Java™ 
r virtual machine on a different computer. A remote object is described by one or 

■ more remote interfaces, which are Java™ interfaces that declare the methods of 
f the remote object. Remote Method Invocation (RMI) is used to invoke a method 

15 of a remote interface on a remote object. RMI is explained in, for example, the 

l| Remote Method Invocation Specification, Sun Microsystems, Inc. (1997) 
;! available at 

^ http.7/www.java.sun.com/products/jdk/1.2/docs/guide/rmi/spec/rmiTOC.doc.html, 
; which is incorporated herein by reference. 
20 ;; As part of RMI, Java™ objects are passed between a client and a server. 
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of itself. The serialized representation of the object contains enough information to 
enable the recipient to identify and verify the Java™ class from which the contents 
of the object were saved and to restore the contents to a new instance. A serialized 
object contains two main parts: the object data and a class descriptor. The class 
descriptor describes the content and format of the object data. 

When a serialized object is passed, the object data and the class descriptor 
are transmitted across the network. Although the object data may change over 
time, the class descriptor remains the same. Therefore, multiple remote method 
calls can result in passing the same class descriptor multiple times to a recipient 
who already has a copy of the class descriptor. This is expensive in terms of 
processing time as well as wasted network bandwidth. It is therefore desirable to 
reduce the number of times that a class descriptor is sent to a recipient. 

SUMMARY OF THE INVENTION 

Systems and methods consistent with the present invention reduce the 
number of redundant class descriptors that are sent during remote method calls by 
using serialization contexts. "Serialization contexts" are dictionary objects that map 
a class descriptor to a corresponding integer handle and, on the receiving end, map 
the integer handle back to the class descriptor. When possible, the integer handle, 
rather than the full class descriptor, is passed, saving processing time in RMI calls. 
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A method consistent with the present invention passes a first object and a 
0 second object, both instances of a class, in distinct remote method calls in a 
|5 distributed system. The first object is passed from a sender to a recipient with a 
descriptor of the class and a handle corresponding to the descriptor. The handle 
5 and the descriptor are stored by the recipient. The second object is then passed 

I; from the sender to the recipient with the handle, and the recipient uses the handle 
II to determine the descriptor. 

J3 ' BRIEF DESCRIPTION OF THE DRAWINGS 

10 :1 This invention is pointed out with particularity in the appended claims. The 

above and further advantages of this invention may be better understood by 
referring to the following description taken in conjunction with the accompanying 
u ! drawings, in which: 

4 Fig. 1 depicts a distributed system 100 suitable for practicing methods and 

1 5 systems consistent with the present invention; 

Fig. 2 is a block diagram showing two serialization contexts consistent with 
the present invention; 

Fig. 3 depicts a flow chart of a method for passing objects using serialization 
!. contexts, consistent with the present invention; 
20 u Fig. 4 is a flow chart of the "handshake" between a sender and a recipient 

it 

;■ to agree on a serialization context pair to use; and 
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Fig. 5 is a flow chart showing how the committed flag can be used to provide 
two-way communication. 

DETAILED DESCRIPTION 

A system consistent with the present invention reduces the number of 
redundant class descriptors that are sent during remote method calls by using 
serialization contexts. "Serialization contexts" are dictionary objects that map a 
class descriptor to a corresponding integer handle. When possible, the integer 
handle, rather than the full class descriptor, is passed, saving processing time in 
RMI calls. 

Figure 1 depicts a distributed system 100 suitable for practicing methods 
and systems consistent with the present invention. Distributed system 100 
includes client computer 102 and server computer 104, communicating via 
network 106. Network 106 may be, for example, a local area network, wide area 
network, or the Internet. 

Client computer 102 includes a memory 108, a secondary storage device 
1 10, a central processing unit (CPU) 1 12, an input device 114, and a video 
display 116. The memory 108 includes a Java™ runtime system 118. The 
Java™ runtime system 118 includes a Java™ virtual machine 120, and a Java™ 
remote method invocation (RMI) system 122. The RMI system 122 contains one 
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or more serialization contexts 124. Memory 108 also includes a program 126 
running on client computer 102. 

Server computer 104 includes a memory 128, a secondary storage device 
130, a central processing unit (CPU) 132, an input device 134, and a video 
display 136, The memory 128 includes a Java™ runtime system 138. The 
Java™ runtime system 138 includes a Java™ virtual machine 140, and the 
Java™ remote method invocation (RMI) system 142, The RMI system 142 
contains one or more serialization contexts 144. Memory 128 also includes a 
program 146 running on server computer 104, and one or more objects 148. 

Using RMI, objects can be passed between client computer 102 and 
server computer 104. For example, a program 146 running on client computer 
102 can invoke a method on an object 148 stored in the memory 130 of server 
computer 104. Client computer 102 would use RMI system 122 to convert the 
method call, including an identification of the remote method and any 
parameters, into a byte stream that is sent to server computer 104 via network 
106. Server computer 104, upon receiving the byte stream, would use its RMI 
system to convert the byte stream into executable bytecode and initiate the 
invocation of the method on the remote object. If the method results in a return 
value, server computer 104 would convert the return value to a byte stream using 
its RMI system, and transmit the byte stream to the client computer 102. 
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The byte streams contain serialized versions of Java™ objects, e.g. 
parameters or return values. A serialized object contains two main parts: the 
object data and a class descriptor. The class descriptor describes the content 
; and format of the object data. Object serialization is explained in, for example, 
5 the Java™ Object Serialization Specification, available at 

http://java.sun.eom/products/jdk/1.3/docs/guide/serialization/spec/serialTOC.doc. 
html, which is incorporated herein by reference. 

Within a single remote method call, a class descriptor is sent with the first 
; object of that type that is serialized, subsequent objects of that type in the same 
10 remote method call refer to the class descriptor with a "back reference" (i.e., an 

m integer handle). 

; Serialization Contexts 

In a serialized object, the class descriptor provides the full name of the class 
% ; and its serialization ID, which uniquely identifies the class. The serialization ID is 
15 : a64-bithashoftheclassnameJnterfaceclassnames, methods, and fields. Each 

r class descriptor is an instance of the Java™ class ObjectStreamClass, defined as 
^ follows: 

public class ObjectStreamClass 
{ 

20 public static ObjectStreamClass lookup(Class cl); 

public String getNameQ; 

public Class forCiassQ; 

public ObjectStreamField[] getFleldsQ; 
LAW orncHs publlc long getSerialVersionUiD(); 

m"c"' i PL-blic string toStringO; 
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! Serialization contexts can be used to pass the class descriptors of serialized 
1] objects. As explained above, serialization contexts are dictionary objects that map 
;! a class descriptor to a corresponding integer handle. When possible, the integer 
5 :| handle, rather than the full class descriptor, is passed, saving processing time in 

i; RMI calls. 

Figure 2 is a block diagram showing serialization contexts 124 and 144 in 
ij more detail, consistent with the present invention. Each serialization context is 
3 I; maintained as a pair of tables: one for outgoing handles, e.g., 202 or 206, and one 

10 ^ |i for incoming handles, e.g., 204 or 208. Outgoing handles are used when a program 

J ij running on the computer acts as a sender (e.g., makes a remote call or sends return 

Y ; values). Incoming handles are used when a program running on the computer acts 

o y as a recipient (e.g., receives a remote call or receives return values). In this way, 
□ a program 126 running on the client computer and a program 146 running on the 

15 5; i; server computer can each act as a sender or recipient. Both the RMI system of the 
\i client computer and the RMI system of the server computer maintain an outgoing 
Ij handle table and an incoming handle table. 

RMI system 122 of client computer 102 contains serialization context 124, 
jj which consists of outgoing handle table 202 and incoming handle table 204, and 
20 !} RMI system 142 of server computer 104 contains serialization context 144, which 

ij consists of outgoing handle table 206 and incoming handle table 208. Each 



LAW OFFICEIS i; 

r^Scarrct?^' I incoming handle table has one or more entries including a handle and a class 

i DUNMER,L.L.P. |i 

OO I STREET, N. W. \\ ^ 

HINGTON^DC £0005 \\ " " 

202-408-4000 \\ 



Attorney Docket No. 06502.0267 

descriptor. Each outgoing handle table has one or more entries, the entries 
including a flag, a handle, and a class descriptor. The flag in each outgoing handle 
I; table entry is a boolean value indicating whether the corresponding handle/class 
descriptor pair is "committed." If a handle/class descriptor pair in an outgoing 
5 handle table is committed, it is known to be saved in the corresponding incoming 

handle table of the serialization context pair. For example, if the committed flag in 
an entry in outgoing handle table 202 is true, then the corresponding class 
descriptor/handle pair has been stored in incoming handle table 208. If the 
committed flag is false, incoming handle table 208 may or may not contain the 
10 i ll corresponding class descriptor/handle pair. The use of the committed flag will be 

m described in further detail below with reference to figure 5. 

^ Figure 3 depicts a flow chart of a method for passing objects using 

serialization contexts, consistent witli the present invention. First, tiie sender (e.g., 
S ^ program 126 running on client computer 102) and the recipient (e.g., program 146 

1 5m. : running on server computer 1 04) agree on a serialization context to use during the 

method call (step 302). As a result of this agreement, or "handshake," the sender 
\ will use serialization context 124 and the recipient will use serialization context 144, 
; as shown in figure 2. The "handshake" process is explained below, with reference 
to figure 4. 

20 When the sender wishes to send a class descriptor to the recipient, the 

i sender checks to see if the descriptor is already defined in the outgoing handle 
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j true, as detailed with reference to one embodiment in Figure 5 below, the sender 

j 

j retrieves the handle corresponding to the class descriptor from the outgoing handle 
I table 202 of serialization context 124, and sends the handle rather than the full 
! class descriptor to the recipient (step 306). The recipient then uses the handle to 
I look up the class descriptor in the incoming handle table 208 of serialization context 
i 144. If the class descriptor that the sender wishes to send is not in the outgoing 
1 handle table 202 of serialization context 124, the sender sends both the class 
I descriptor and a new handle (step 31 0). For subsequent calls, the sender can send 
I just the handle to the recipient. 
I Handshake 

Figure 4 is a flow chart of the "handshake" between a sender and a recipient 
i to agree on a serialization context pair to use. When a connection between the 
i sender and the recipient is established, e.g., when a new RMI session begins, the 
\ sender and recipient "handshake" to agree on a serialization context pair to use, as 
I stated in step 302 of figure 3 above. Each pair of serialization contexts, e.g., 

\ serialization contexts 124 and 144, is identified by a globally unique context ID. This 

■I 

I context ID is used to perform the handshake. First, the sender determines whether 
1 one of the sender's serialization contexts is associated with a serialization context 

i of the recipient (step 402). If so, the sender transmits the context ID for that 

i 

j serialization context pair to the recipient (step 404). Otherwise, the sender 

i 
\ 

\ transmits a null ID to the recipient (step 406). If the recipient receives a non-null 
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context (step 410). If it does, the recipient echoes the context ID back to the sender 
(step 412). 

If the recipient does not have the serialization context corresponding to the 
context ID received, or if the recipient receives a null ID, the recipient creates a new 
serialization context (step 414) and sends the new context ID to the sender (step 
416). The sender then knows that, if it receives the same context ID that it sent, it 
can use the proposed serialization context. Otherwise, the sender should create a 
new serialization context with the new context ID and use that instead. 
Using committed flags to enhance two-way communications 

Figure 5 is a flow chart showing how the committed flag can be used to 
enhance two-way communication by ensuring that handles are sent without class 
descriptor definitions only when the receiving end is known to have previously 
received a definition (i.e., a class descriptor) corresponding to the handle. As 
described in step 306 of figure 3 above, when the sender determines that a class 
descriptor is already defined in outgoing handle table 202, the sender can send the 
corresponding handle, rather than the full class descriptor, to the recipient. 
However, before sending the handle, the sender can use the committed flag to 
ensure that the sender has an entry containing the class descriptor/handle pair in 
incoming handle table 208. 

To use the committed flag in this way, the sender first checks to see if the 
class descriptor is in the outgoing handle table 202 of serialization context 124 (step 
^S^A^? i s®"^®"" ^^^^^^ the value of the corresponding committed flag 

-12- 
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M 

I (Step 504). If the committed flag is true, the sender can send the handle, knowing 

II that the class descriptor/handle pair is stored in the recipient's incoming handle 
11 table 208 (step 506). 

If the class descriptor is not in the outgoing handle table 202 of serialization 
:| context 124, the sender creates a new entry, with a new handle and a committed 
i flag set to false, in outgoing handle table 202 (step 508) and sends the new handle 
ji and class descriptor to the recipient (step 510). The recipient stores the new class 
li descriptor/handle pair in incoming handle table 208 (step 512). The sender would 
;i also send both the class descriptor and the handle to the recipient if the class 
11 descriptor is in outgoing handle table 202, but the committed flag is false (steps 510 
i; & 512). The recipient would simply discard any duplicate handle/class descriptor 
:l pairs received, 
f j; Handle acknowledoment - arguments 

+ To rely on the committed flags as described above, there must be a way to 

15 p update the flags in both the sender's outgoing handle table and the recipient's 

J outgoing handle table. This updating can be done using the arguments sent from 
li the sender to the recipient and the return values returned from the recipient to the 
ii sender. 

When an argument, including data and a class descriptor/handle pair, is sent 
20 ;| from a sender (e.g., program 126 running on client computer 102) to a recipient 

(e.g., program 146 running on server computer 104), the recipient uses the class 
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As part of this process, the recipient enters any new handle/class descriptor pairs 
into the incoming handle table 208. In one ennbodiment, this updating occurs before 
the nnethod call can return successfully to the client. Therefore, when the remote 
method call is successfully returned to the original sender, the handle/class 
5 descriptor pair is implicitly acknowledged, and the sender can set the corresponding 

committed flag in the outgoing handle table 202 to true. 
■ Handle acknowledgment - return values 

Class descriptors used by the recipient (e.g., program 126 running on server 
computer 104) to send return values to the sender (e.g., program 146 running on 
1 0 i 15 client computer 1 02) require an explicit acknowledgment. The recipient has no way 

m of knowing whether the sender successfully stored the handle/class descriptor pair 

sent with the return value in the incoming handle table 204. To acknowledge that 
i;; ; the incoming handle table 204 has been updated, the sender sends an explicit 
+ acknowledgment of its successful receipt of the handle/class descriptor pair with its 

1 5 " :i next call to the recipient. The acknowledgment can be delayed in this way because 
ii the recipient only needs the acknowledgment if there are future communications 
\\ between the sender and the recipient, 
i; Garbage collection 

Serialization contexts can get quite large. If a pair of tables is no longer 
20 needed, memory space can be saved by deleting the tables. Preferably, this 

"garbage collection" is made possible by using the globally unique ID codes 
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6 contexts to their unique ID codes can be maintained by, for example, RMI system 

i 124 or RMI system 144. Space in this table Is "leased," meaning that after a set 

ii amount of time has passed, a serialization context/unique ID code pairing is deleted 
i: from the table. Each time a serialization context is accessed by an object, e.g., a 

5 i! program running on client computer 1 02 or server computer 1 04, the lease time is 

:l reset. Therefore, serialization contexts will automatically be available for a set 
|i amount of time between uses. After the set amount of time expires and a 
I serialization context is deleted, a new table is created when a client wishes to 
;=y :l communicate with the server, as described in Figure 5. 
1 0 Other embodiments of the invention will be apparent to those skilled In the 

S li art from consideration of the specification and practice of the invention disclosed 
T i herein. It is intended that the specification and examples be considered as 

M ii exemplary only, with a true scope and spirit of the invention being indicated by the 
f 11 following claims. 
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